BioTagger: A Biological Entity Tagging System

نویسندگان

  • Hongfang Liu
  • Cathy Wu
چکیده

The results submitted for Task 1B is the outcome of the prototype system of an ongoing biological entity tagging system, called BioTagger, which decomposes the tagging task into several subtasks and considers novelty, synonymy and ambiguity associated with terms representing biological entities in text: • Automatic construction of a comprehensive dictionary for biological entities using online resources • Automatic acquisition of disambiguation knowledge from these resources • Intelligent dictionary lookup that considers novelty, synonymy, and ambiguity • Training a POS tagger using unsupervised machine learning techniques to further consider novelty and ambiguity, and training disambiguation classifiers to perform corpus-based disambiguation to further resolve the ambiguity

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Research Paper: BioTagger-GM: A Gene/Protein Name Recognition System

OBJECTIVES Biomedical named entity recognition (BNER) is a critical component in automated systems that mine biomedical knowledge in free text. Among different types of entities in the domain, gene/protein would be the most studied one for BNER. Our goal is to develop a gene/protein name recognition system BioTagger-GM that exploits rich information in terminology sources using powerful machine...

متن کامل

Named Entity Recognition in Persian Text using Deep Learning

Named entities recognition is a fundamental task in the field of natural language processing. It is also known as a subset of information extraction. The process of recognizing named entities aims at finding proper nouns in the text and classifying them into predetermined classes such as names of people, organizations, and places. In this paper, we propose a named entity recognizer which benefi...

متن کامل

Chinese Named Entity and Relation Identification System

In this interactive presentation, a Chinese named entity and relation identification system is demonstrated. The domainspecific system has a three-stage pipeline architecture which includes word segmentation and part-of-speech (POS) tagging, named entity recognition, and named entity relation identitfication. The experimental results have shown that the average F-measure for word segmentation a...

متن کامل

Named Entity Recognition System for Postpositional Languages: Urdu as a Case Study

Named Entity Recognition and Classification is the process of identifying named entities and classifying them into one of the classes like person name, organization name, location name, etc. In this paper, we propose a tagging scheme Begin Inside Last -2 (BIL2) for the Subject Object Verb (SOV) languages that contain postposition. We use the Urdu language as a case study. We compare the F-measu...

متن کامل

Correcting Word Segmentation and Part-of-speech Tagging Errors for Chinese Named Entity Recognition

In the exploration of Chinese named entity recognition for a specific domain, the authors found that the errors caused during word segmentation and part-ofspeech (POS) tagging have obstructed the improvement of the recognition performance. In order to further enhance recognition recall and precision, the authors propose an error correction approach for Chinese named entity recognition. In the e...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004